Realization of Minimum Discursive Units Segmentation of Arab Oral Utterances
نویسندگان
چکیده
Unlike the written texts, discourse segmentation of the Arab oral dialogues is a challenging task that is held back in most cases by the spontaneous character of oral speech. Like any segmentation task, segmentation in minimum discursive units (UDM) aims to cut the different statements of a speech into simple proposals easily usable in subsequent treatment. The majority of the work on the Arabic language was based on extensive syntactic analysis approaches. In this article, we try to show the effectiveness of hybrid approaches combining linguistic and probabilistic processes over purely linguistic approaches. The performance of our segmentation was evaluated on a relatively large size corpus. We built this corpus by using the method of the wizard of Oz.
منابع مشابه
Audio Speech Segmentation Without Language-Specific Knowledge
Speech segmentation is the problem of finding word boundaries in spoken language when the underlying vocabulary is still unknown. Here we show that a system with no phonemic knowledge can find word boundaries. The system first subdivides an utterance by recursively clustering similar parts of the signal together until the cepstral coefficient variance is low within each new segment. These segme...
متن کاملThe Challenges of the Elections Systems of Persian Gulf Arab Countries
This article intends to clarify views regarding important challenges that have originated from the political, social, cultural and geopolitical structures in the elections systems of Persian Gulf Arab countries. Challenges that determine the compatibility levels of elections systems of these countries with the world’s democratic systems. An efficient elections system is the prerequisite for the...
متن کاملAutomatic initial and final segmentation in cleft palate speech of Mandarin speakers
The speech unit segmentation is an important pre-processing step in the analysis of cleft palate speech. In Mandarin, one syllable is composed of two parts: initial and final. In cleft palate speech, the resonance disorders occur at the finals and the voiced initials, while the articulation disorders occur at the unvoiced initials. Thus, the initials and finals are the minimum speech units, whi...
متن کاملTextuality: The ‘form’ to Be Focused on in SLA
Due to the special (procedural) nature of the language (verbal communication) ‘knowledge’, the dominant trends in applied linguistics research in the last few decades have been advocating ‘acquisition’ rather than ‘learning’ activities where the main focus in SL & FL education should be on ‘meaning’ while some ‘focus-on-form’ being justified. But the ‘form’ to be ‘focused-on’ is mostly misconce...
متن کاملAutomatic Labeling of Corpora for Speech
One of the bottlenecks in the development of text-to-speech synthesizers based on segment concatenation is the need for large, segmented and labeled corpora. Consequently, as manual segmentation and labeling is a tedious and time consuming task, there is a strong demand for automatic labeling systems which can label speech from many languages. Several systems have been proposed already, but the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Int. J. Comput. Linguistics Appl.
دوره 7 شماره
صفحات -
تاریخ انتشار 2016